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I. PREVIOUS MODELS OF LEARNING 


1.1, Psychological and physiological models 

Psychological models of learning behaviour normally describe what changes will 
occur as a result of a stipulated experience; they consider the ‘before’ and ‘after’ 
stationary states of the organism, and, on the whole, neglect the non stationary 
transition. It is true that statistical models (Estes, 1950; Bush and Mosteller, 1955; 
Siegal, 1959) make a broad comment upon the actual process, but their predictive 
value is confined to parameters such as the number of trials required to reach a given 
criterion of performance. Few of the psychological models propose a mechanism for 
learning. Those that do are either testable but restricted, e.g. Brown’s (1958) model 
for short term memory, Broadbent’s (1958) filter model or, as in the case of Hull (1952) 
and Eysenck (1956) there is some doubt about the status of the intervening variables 
that are introduced. Without exception, the testable models are open to the criticism 
that they deal with learning in very restricted conditions such as a Skinner box (1938) 
or a maze, and consequently, although they refer to adaptive behaviour amongst a 
specified set of alternatives their validity as models of learning depends upon the 
assumption that this facet of behaviour can be regarded as relevant and the rest of it 
(the maundering of the rat, the idle thoughts of the man) can be neglected. The 
evidence seems to contradict this supposition and (whilst it is difficult to offer an 
alternative) it is fair to say that this form of model refers to reinforced selection and 
completely omits the issue of attention. 

Physiological models (Pavlov, 1927; Sherrington, 1948; Hyden, 1960; Bok, 1961) 
rightly avoid commitment to learning as such and describe some process within the 
brain. Thus Pavlov was acutely aware of the difficulties of extending his conditioning 
models beyond the experimentally verified region. In general, with any physiological 
model, it is necessary to distinguish possibility from actuality with great care. A con- 
ditioning mechanism, for example, undoubtedly could account for adaptive behaviour; 
but that is different from saying it does account for this behaviour in every organism 
or upon every occasion. 


1.2. The approach of cybernetics 

Some cybernetic models are derived from a psychological root, for example, Rosen- 
blatt’s (1961) perceptron and George’s (1961) automata stem largely from Hebb’s 
(1949) theory. Others, such as Grey Walter’s (1953) and Angyan’s (1958) respective 
tortoises, have a broader behavioural antecedent. 

On the other hand, neurone models, like Harmon’s (1961) and Lettvin’s (1959), are 
based upon facts of microscopic physiology and have the same predictive power 
linked to the same restrictions as an overtly physiological construction. 
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Next, there are models which start from a few physiological facts such as known 
characteristics or connectivities of neurones and add to these certain cybernetically 
plausible assumptions. At a microscopic level, McCulloch’s (1960) work is the most 
explicit case of this technique (though it does not, in fact, refer to adaptation so much 
as to perception) for its assumptions stem from Boolean Logic (Rashevsky, 1960, 
describes a number of networks that are adaptive). Uttley (1956), using a different 
set of assumptions, considered the hypothesis that conditional probability compu- 
tation occurs extensively in the nervous system. At a macroscopic level, Beurle (1954) 
has constructed a statistical mechanical model involving a population of artificial 
neurones which has been successfully simulated, whilst Napalkov’s (1961) proposals 
lie between the microscopic and macroscopic extremes. 

Cyberneticians are naturally concerned with the logic of large systems and the logical 
calibre of the learning process. Thus Willis (1959) and Cameron (1960) point out the 
advantages and limitations of threshold logic. Papert (1960) considers the constraints 
imposed upon the adaptive process in a wholly arbitrary network, and Ivahnenko 
(1962) recently published a series of papers reconciling the presently opposed idea of 
the brain as an undifferentiated fully malleable system and as a well structured device 
that has a few adaptive parameters. MacKay (1951) has discussed the philosophy of 
learning as such, the implications of the word and the extent to which learning behav- 
iour can be simulated; in addition to which he has proposed a number of brain-like 
automata. But it is Ashby (1956) who takes the purely cybernetic approach to learning. 
Physiological mechanisms are shown to be special cases of completely general systems 
exhibiting principles such as homeostasis and dynamic stability. He considers the 
behaviour of these systems in different experimental conditions and displays such 
statements as ‘the system learns’ or ‘the system has a memory’ in their true colour 
as assertions that are made relative to a particular observer. 


2. THE CONCEPTION OF LEARNING 


2.1, Adaptive behaviour and learning 


The adaptive behaviours called ‘learning’ are inadequately demarcated. As a matter 
of logic, ‘learning’ must be held apart from adaptation alone (although adaptation 
always accompanies ‘learning’), for w:thout this dist’nction, most mechanistic com- 
ments on the subject become vacuous. Ashby has shown that any large dynamic 
assembly must adapt and retain some mark of its previous configuration. Hence, to 
say that a brain or a large artifact does exhibit this behaviour, is an undiscriminating 
and almost tautologous remark. By the same token, ‘memory’ (inferred from an ob- 
served learning behaviour) must be distinguished from ‘retention’. 

Another of Ashby’s results (1958) has demolished the once popular belief that 
‘learning’ is simply an adaptation directed to some objective, like sustenance of the 
learning organism. This view is far too naive; all stable systems are homeostatic in a 
tenable environment and all homeostasis can be credited with an objective. If the 
environment is perturbed in a particularly regular fashion the adaptive changes that 
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maintain the homeostasis are called habituation which is a predictable consequence of 
elaborate structure. 


2.2. The implications of learning 

It is generally agreed that over and above directed adaptation learning entails the 
construction and appreciation of order amongst objects and their images. An organism 
that learns becomes able to deal coherently with its environment, or the flux of its 
autonomous activity, by imposing a plan upon otherwise haphazardly arranged con- 
stituents. Minsky and Selfridge (1960), in the idiom of ‘artificial intelligence’ require 
generalisation and the development of ‘a heuristic’ as indices of learning, whilst 
MacKay (1956) has maintained the very similar view, that learning of any kind entails 
building an hierarchy of Janguages or systems of signs and relations. The hierarchy is 
realistic insofar as some translation can take place between these different levels of 
discourse. 


2.3. The criterion of identification 

For the present purpose we shall concentrate upon the act of identification, or in 
Carnap’s (1947) sense, an interpretation of some nonlogical sign in a calculus in terms 
of a designatum, as the crucial feature of learning (‘translation of a message between 
languages’ and ‘identification of a pair of otherwise incomparable sign sequences’ are 
operationally equivalent). Consequently an organism that /earns must also be able to 
identify relevant stimuli (or values of attributes in its environment), to bring its at- 
tention to bear upon a limited universe of discourse which includes these stimuli and 
the response alternatives amongst which adaptation occurs. Similarly, it must form 
and use named ‘concepts’ or generalisations, the crux of this process being identifi- 
cation between distinct concepts (called ‘insight’) rather than the aggregation of 
common features which is a relatively trivial result of constrained adaptation (notice 
that this kind of ‘identification’ is a ‘translation’). 


2.4. Restrictions imposed upon mechanisms and models of learning 

Once a more sophisticated view of learning has been accepted, we are forced to 
discard most of the neat and familiar representations of mental or neural activity. In 
particular, the restricted models of metricised information theory are inadequate 
vehicles for a description of any mechanism supposed to account for learning behaviour 
(Pask, 1958a,b). Of course, there is a sense in which information is communicated 
and transformed when an organism is learning but as Greene (1961) points out (when 
he remarks that percepts carry with them overtones of other messages) this is not the 
sense of any metricised information theory. McCulloch and MacKay (1952) said much 
the same when they discussed the status of ‘signals’ in a brain. Information is rep- 
resented in various ways, for example, in terms of impulse rate and impulse phase 
relation and neither an impulse nor any other event is unambiguously a signal. 


2.5. Finite automata 
Now information theory is applicable within the framework of any one represen- 


LEARNING BEHAVIOUR 181 


tation. This representation defines a set of permissible outcomes and messages and 
codes. Similarly, any one representation defines a set of abstract models which can 
be identified with a physical brain, by an observer who specifies his state description 
and forms a system in the sense of Ashby. Certain of these systems are isomorphic or 
homomorphic images of one another, and within such a well related set of systems there 
is a calculus equivalent to the calculus of information theory. A typical abstract model 
is a McCulloch and Pitts (1947) network of ‘artificial neurones’, a typical system, its 
identification with attributes of some part of a brain, and in this case the transformable 
set of systems is the set of (possibly restricted) finite automata. 

But there is no guarantee that information theory or any other calculus is applicable 
between representations (indeed we must assume the contrary). 


2.6. Need for many representations 

Now we have argued that learning is crucially dependent upon an act of identifi- 
cation, in other words, upon a relation between different representations of data. But 
in this case, the process of learning cannot be described, or at best, can rarely be 
described, in terms of a single information system. Normally it will be necessary for 
an observer who investigates learning to reconstruct his identification and remake his 
state description whenever the observed organism ‘changes its attention’, or exhibits 
‘insightful behaviour’ (that is, whenever it changes its identification). 

No single system can represent the mechanism that underlies learning. No finite 
automaton can be held to learn. A search for the mechanism of learning is either 
unduly parochial or misconceived for it is the activity of many systems, not one, that 
is correlated with learning behaviour (or, we can say, it is a single self organising 
system, as the phrase is used by Von Foerster, 1960, of which the separate systems 
are components, related by the appearance of learning and possibly in no other way). 


2.7. The field of enquiry 

It is important to recognize that these comments apply to a field of enquiry, namely 
‘learning behaviour’ and not to all facets of mental or neural activity. Thus Harmon 
(1959) is on firm enough ground when he says that impulses are the minimal com- 
ponents of messages in the language of neurones, simply because Harmon is not 
directly concerned with learning and is consequently able to consider events in a 
single language system. But since we are concerned with learning, the pertinent en- 
quiries refer to many language systems (and their structure is changed by the process 
being” investigated). Thus we cannot afford such elegant and convenient models nor 
at the"present state of knowledge can assertions about learning be reduced, say, by 
Nagel’s (1949) procedure, to expressions in the conceptual framework of neural 
networks. 

On the*other hand, it would be just as absurd to suggest that networks and finite 
automata are valueless in the role of explanatory devices as it would be to suggest 
that bond'models of chemical molecules are valueless explanatory devices in reaction 
chemistry. For whilst it is legitimate to remark that neurone networks (or bond 
models) sare unable to answer our enquiries concerning the mechanisms of learning 
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(or the dynamics of a reaction) probabilistic combinations of either model have 
predictive value. Their merit is conceptual simplicity. Their weak point is that given 
a probabilistic combination which seems to fit the facts, we are left with an indetermi- 
nacy* (Pask, 1962; Von Foerster and Pask, 1960/1961), or uncertainty regarding the 
form of organisation ultimately responsible for the observed behaviour. 

This indeterminacy comes from speaking in terms of one language about a construct 
that entails many languages, and it can only be removed by an act of identification, 
in particular, if the observer translates expressions from the language of network 
model or bond model descriptions, into the corresponding expressions in a more 
comprehensive language or metalanguage. 


3. EXPERIMENTAL VERIFICATION 


3.1. Difficulty over linguistic ideas 

In order to pass from abstract organisations to physical assemblies, like the brains 
of men, animals and artifacts, we shall construct several experimental tests for learning. 
The greatest difficulty at this stage, is confining our discussion to the subject of 
learning. In an attempt to avoid the whole gamut of linguistic behaviour, I may do 
violence to many linguistic ideas by inadvertent mistakes or by deliberate but ille- 
gitimate specialisation. 


3.2. Environments where learning occurs 

The primitive constituents of a test situation are an organism (said, after testing, 
to Jearn or not to learn), its environment of objects and actions and its observer. 
Depending upon the extent to which he infers similarity with the organism, the 
observer may produce and respond to the actions and objects that constitute the 
environment. If the organism is a man, the observer is assumed to infer considerable 
similarity, if it is an animal or an artifact rather little similarity. But, in any case, 
the inference is made on grounds that are remote from the experimental evidence. 

There is always a sense in which the environment is commonly named by the 
organism and the observer. (/) Given a strong inference of similarity the set of possible 
objects and actions is taken to be identified in substantially the same way by the 
observer and the organism, hence to constitute on the one hand, a set of stimuli, or, 
in the case of verbal discourse of words, on the other hand, a set of nameable response 
alternatives, which may also be words or sequences of words. (2) Lacking a strong 
inference of similarity the observer must be able, by previous observation or stimulus 
equivalence investigations, to describe the effective stimuli (and place them in corre- 
spondence with the objects and actions he perceives), and to describe the response 
alternatives for the organism. Thus Lettvin ef a/. (1959) described the perceptual 
attribute space for the frog and described how a frog could respond. 


* The word ‘indeterminacy’ is used to distinguish this uncertainty from the more familiar classical 
uncertainty, given a set of relevant attributes defining a system, regarding the state of the system. 
Whereas a ‘classical’ uncertainty is reduced by making more microscopic observation, ‘indeterminacy’ 
is not. Although classical uncertainty may often perturb real observations of brains, it has little to 
do with the immediate discussion. 
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We thus characterise the test environment by a set X of signs x for objects and 
actions affecting the organism as its input, and aset Y of signs y for responsive actions 
affecting objects in the environment and which constitutes its output. Manifestly, if se- 
lections from X and Y are to be signals representing information, neither X¥ nor Y 
can be unrestricted. Given this, we call X U_ Y the finite vocabulary of an operational 
language (that is, a system consisting of the signs in X¥ or Y and relations that appear 
as invariants of the organism’s behaviour), whereby the organism indulges in discourse 
with its environment. Tentatively, x € X will be identified with ‘subject’ words and the 
output signs y € Y with ‘predicate’ words and the organism itself as any operator 
capable of performing the ‘and so’ operation of coherent discourse to produce utter- 
ances (xy) (in the sense of M. Mastermann, 1958). The least organism of this kind 
(which features in our formal discussion) must contemplate or take into account at 
least the possible pairs x, y, of subjects and predicates when selecting some y from Y. 

We should emphasize that any construction that involves organisms and operational 
languages is specified relative to a scientific or descriptive language A (used and compre- 
hended by a body of observers). There is no guarantee that the structure of the 
operational languages will be preserved if the descriptive language is changed, say 
from 4 > A*. It may be possible to represent systems of operational languages in a 
manner that is invariant with transformations 4 - A*. The recent discussions of 
cybernetic logic by Gunther (1962) indicate that this can be achieved, at any rate in 
principle. But, whilst admitting that relativism is unsatisfactory, we shall not discuss 
the issue any further in the present paper. 

Now a scientific language is a metalanguage, in terms of which an operational 
languages is described. We shall concentrate upon those systems constructed in A, 
within which it is possible to distinguish more than one operational language. 

Let us denote these operational languages a, #, and their finite vocabularies 
X4*y Y4,X8U Y8, Let permissible or sensible utterances have the form U = (x, y,) 
and denote the set of permissible a utterances U@ C X4 @ Y4@ and the set of per- 
missible # utterances U8 C XF & Y8, 

It is not too difficult to specify the conditions that must be satisfied if a and # are 
to exist in our A description of a system if we conduct the discussion in rather vague 
terms. Hence, we shall give a broad account and become a little more precise, when 
we need to be, later (see Appendix). 

(1) If a and f are operational languages in A it must be possible to distinguish (also 
in the A description of the system) distinct organisms capable of using these languages 
for communication. 

(2) These organisms must belong to at least a couple of different species (say the 4 
species of organisms and the B species of organisms), and communication must occur 
between the members of each species of organisms; being characterised in the case 
of a by utterances uw € U@ and in the case of f by utterances u € UP, 

(3) In order that a and £ be distinct, the A system of communication must be 
distinct from the B system. The distinction could be secured in various ways of which 
only one will be cited. 

(4) () There must be at least some signs, used in the a discourse, which do not 
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appear in the 8 discourse and vice versa, hence (X2U Y*) N(X8U Y4) C (X4#u Y4) 
u (X8u Y8), (i It will be convenient to assume that U¢ and U® are disjoint. Thus 
[U* \ U2] = 0 where 0 denotes the empty set. (111) The intention behind (1) is that 
U2 and Ué@ are sets of utterances which enjoy the property, for any up € U4 that ‘uo 
means the same thing to all organisms of the A species’ and similarly for any uw, € U4 
that ‘uv; means the same thing to all organisms of the B species of organism’. (We shall 
make the relation of ‘meaning the same thing’ more precise, later. For the moment, 
assume that a given utterance ‘means the same thing’ to a pair of organisms if, when 
received in the same conditions, it induces the same behaviour). (Iv) Hence [U2 NM U#] 
= 0 implies that there is no u which means the same thing to ail A species organisms 
and all B species organisms. 

(5) This condition (which entails complete lack of relation between a and # with 
respect to universally interpreted signs) is a shade more restrictive than necessary, 
although it will be convenient to adopt it. But in any case we should insist that a 
and # are neither isomorphic nor homomorphic images of one another when the 
‘and so’ precedance of x, y, is interpreted as a composition. 

(6) However, there must be a possibility of ‘translation’ between a and # or vice 
versa. We shall interpret a translation as a special transformation that has the index 
of the particular organism that performs this translation as its parameter. For this 
purpose, we introduce the concept of a limited area of ‘meaning’. An a sign may 
mean the same thing to one member of the A species of organism as some sign or 
some set or some sequence of # signs (but this cannot be a universal implication, true 
for all members of the A species of organism). Similarly, translation can be employed 
by B organisms to interpret a signs. 

We shall argue that the possibility of ‘translation’ rests upon a similarity between 
the A species organisms and the B species organisms which exists even though they 
use distinct communication systems. (In our discussion we shall be chiefly concerned 
with similarities of physical structure or similarities in the relation between these 
organisms and a common environment, but there are many other possibilities. Thus 
a French and an Italian enjoy similar cultural backgrounds, and their discourse has 
a similar semantic content.) 

(7) The a operational language and the f operational language remain unrelated 
with respect to the universally true assertions of the participating organisms. But they 
are, of course, perfectly well related from the viewpoint either of the 4 observer or a 
particular organism that identifies or translates a set of utterances, hence, according 
to our criterion, an organism that /earns. 

The possible relations a, 8 and f, a are (1) that a and # characterise communication 
systems capable of sustaining the same level of discourse or (2) that a is a meta- 
language with reference to the operational language # (and is thus capable of de- 
scribing some aspect of ) or, vice versa, that B is a metalanguage with reference to a. 
If a is a metalanguage relative to f (or vice versa) the functions determining a dis- 
course, are of higher order and propositions in a are of a higher logical type than 
the corresponding logical functions and propositions in f; the hierarchy of orders or 
types being determined by elementary propositions in A, To elaborate the point, a 
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signs which might, for example, include ‘furniture’ cannot be derived from # signs, 
such as ‘table’ or ‘chair’. To say that ‘furniture’ cannot be derived from expressions 
in P means (a) that the category of furniture regarded as a disjunctive concept in f 
must be defined ostensively by exemplars. But the choice of these exemplars is un- 
limited (we can only specify furniture as the particular tables and chairs and other 
objects pointed out by a # organism). (b) If the category ‘furniture’ is regarded as a 
conjunctive concept entailing those attributes common to tab‘es and chairs, the choice 
of attributes is unlimited (and we can only specify the particular subset selected by 
this 6 organism). 

(8) This organism specificity suggests that whatever the logical status of a and 8 
(from the A observer’s point of view) any participant B organism, performing a trans- 
lation from a to f must use f as a metalanguage. For this B organism must construct 
a description, based upon its experience, of the usage of a terms. Indeed we shall 
argue that it must construct within itself an image of an A organism and look at the 
environment in an A-like representation. 

In the limiting case, which we shall examine in connection with the mechanism of 
learning, the analogues of A and B organisms have a repertoire of utterances which 
completely specify their behaviours. These organisms could be manifest in no other 
way, and in order to exist, they must make precisely these utterances. A complete a 
to B translation performed by a B organism thus amounts to constructing an infor- 
mational replica of an A organism in the medium of the B organism concerned. In 
other words, an a > f translation amounts to reproducing an A organism. We shall 
return to this point in a moment; its immediate relevance is that it demonstrates an 
identity between the chosen criterion of learning and ideas which appear in Wiener’s 
(1961) account of the reproductive process, which he is concerned with developing 
at the moment (Wiener, 1962). Specifically we shall identify learning with a cybernetic 
property of organisations that reproduce or evolve in media like brains and social 
aggregates and will argue that at least in certain special cases (Wiener’s concept applies 
to the general case of the ‘Black and White Box’ reproductive system) the fact of 
evolution (which entails some reproductive mechanism) will also entail the manifes- 
tation of learning behaviour. 


3.3. Tests for learning behaviour 

Learning occurs insofar as the system said to learn, is able to identify utterances or 
signs in its operational language with novel designata; if these designata are utterances 
in a different operational language, the organism can learn insofar as it can perform 
a translation. 

Tests of this ability may involve either control or communication and each kind of 
test is fairly commonly used. So far as contro/ is concerned, the experimenter specifies 
and the subject agrees or is known to accept an objective which can only be realised 
either (1) by recognising attributes of events or patterns of distinct stimuli which have 
not been encountered upon previous occasions (these are commonly regarded as tests 
of percept’on or abstract’on), or (11) by constructing a novel categorisation of objects 
in the environment (these are grouping tests). In either case the subject is required to 
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identify signs in his own language with signs in the language of a feedback loop, 
stabilising the desired objective, which the experimenter has contrived in the environ- 
ment. Of course, only a particular form of learning is detected, namely a form which 
entails order relations that the experimenter can appreciate himself. 

So far as communication is concerned the test situation is normally an interview 
(a special case is considered by John Clark). We assume that any pair of men have 
different operational languages or equivalently different ‘ways of thinking’ which they 
prefer to use. (The test is rarely applied to animals because of obvious but not in- 
surmountable difficulties.) In conversation, ‘translation’ occurs in the sense that the 
compromise needed to maintain coherent discourse about an unfamiliar topic involves 
the construction of a language or ‘way of thinking’ that is commonly intelligible to 
the participants. Hence the interviewer who is anxious to evaluate learning endeavours 
to estimate the other participants’ ability to communicate in this situation, discounting, 
so far as possible, the effect of extraneous variables, such as ‘diffidence’ or emotional 
bias. The disadvantage of this method is that the observer who conducts the test is, 
himself, a participant and his evaluation must be translated into terms of A, and a 
rather similar criticism applies to the alternative technique of observing a man who 
is interacting with a group of other men (to assess the extent to which this individual 
makes himself understood and manages to understand the point of view adopted by 
the others). 


3.4. Systems capable of learning 

In order to exhibit ‘learning’ as a cybernetic property of organisations, it is neces- 
sary to represent the systems we consider in our A description in sufficiently abstract 
and general terms. For the present purpose the most convenient representation is 
Rosen’s (1958) wherein a system is conceived as a directed graph, such as illustrated 
in Fig. la. 


So 


Fig. la. 


The directed edges of the graph correspond to mappings F, the nodes to the con- 
junction of the domain of a mapping and the ranges of some other mappings as in 
the Appendix. 

Now this representation can be used to depict either the physical embodiment of a 
communication system £, in which case the several mappings in the graph represent 
physically distinct subsystems =; and coupling functions between these subsystems; 
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or, alternatively, it may be used to depict the communication system as an information 
structure 2¢ or 54 without any particular physical referent; in which case the mappings 
are transformations of signs and the graph determines which sign transforming sub- 
system 5”, = im can provide an input to another 5”, i . Communication entails the 
existence of cyclic subgraphs, like a and b in Fig. 1b. 





Fig. 1b. 


The process of adaptation appears at the level of physical embodiment. It is for- 
malised as change in a parameter q; (for subsystem 5) which selects a particular 
mapping fi from a set of possibilities F;. Since the domain of fg; may be the domain 
of F; or a restriction of this domain, adaptation not only changes the kind of function 
that characterises 5; but also the momentary couplings between 5; and the other 
subsystems (within limits defined by the system graph). All subsystems of interest are 
adaptive. 

To pass from the representation of a physical embodiment of a communication 
system to the representation of a communication system, it is necessary to abstract 
special attributes of the physical states which are capable of acting as signs and 
sequences of which constitute utterances. 

We comment that assertions about communication systems and their physical 
embodiment necessarily are assertions of a different logical type. Now in A we need 
to distinguish a pair of communication systems, A and B, mediated by the same 
physical embodiment; hence we need to consider a pair of abstractions, one of which 
induces a signs and the other # signs. Within each of the systems A and B communi- 
cation must occur and the ‘signs’ must have sensible characteristics, for example, if 
x is an a sign then it is reasonable to demand that x ‘means the same thing’ to all of 
the subsystems A that ‘speak’ a and which happen to be similarly adapted; but, in 
order to preserve a distinction between a and f an a sign must not ‘mean the same 
thing’ to all B subsystems. Further, from inspection of the directed graph, the calibre 
of an utterance x, y, is a relation between an individual subsystem and its neighbours 
(for X;, the set of signs that act as an input to 3 is a product set whereas Y/ is a 
factor in a product set) and we insisted, in 3.2., that the a and P utterances remain 
distinct. 

Now it appears impossible to offer an acceptably non trivial representation of the 
phrase ‘means the same thing’ in terms of the information structures A or B alone. 
Perhaps the least image which avoids the criticism that we are talking about a calculus 
rather than a language is ‘x means the same thing to 5; and 2; if and only if, when 
vi = 9, reception of x by 5; gives rise to the same state of ; as reception of x by 
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5; induces in 5), The criterion of similarity is an equivalence relation over the states 
of the physical embodiment. It has the same perhaps arbitrary origin as the partition 
of states into the states of distinct subsystems and the construction of a directed graph. 

It is not too difficult to constrain the system so that X¢ = UX), Y¢ = Ud Y)) 
are defined (together with X4 and Y4), and to ensure that U2 M U8 = 0 providing, of 
course, that the underlying structure of the physical embodiment remains invariant. 
It is now necessary to enquire what a translation would imply in such a system and 
how it might be represented. 

The particular approach we have chosen is this; within the system as it stands 
translation cannot occur, but there is no reason why the a and f utterances of coupled 
subsystems 5; U &; in A, for example, should not ‘mean the same thing’ as the utter- 
ances of coupled subsystems in B,such as 2, U ©), for these utterances have the form 
(u, u), rather than u and if the underlying states (say of &;) are z in Z; the states of 
a coupled subsystem (say of 5; U &)), have the form (z, z), in Z; © 2;). Now to 
introduce a strong coupling, as against the existing concept of communication, is com- 
pletely arbitrary unless it can be justified apart from our desire for a translation mech- 
anism. We attempt to justify the innovation on the grounds that whenever an act 
of utterance is determined then it does not constitute an act of communication; for no 
uncertainty regarding the state of the recipient is removed by observing this utter- 
ance. Now it is possible to construct a system in which adaptation leads to precisely 
this deterministic condition so that, in the first place, groups of subsystems are im- 
pelled to communicate and then, as a result of communication, become so closely 
coupled that they can no longer be regarded as separate entities. The construction 
entails the idea of survival of the fabric or physical embodiment of the =; which de- 
pends, by hypothesis, upon some commodity (represented by an index 6 (£;)). If 6(2:) 
exceeds some arbitrary value 00, =; survives. If not, 2; does not survive and the di- 
rected graph 2 : is not defined; hence any realistic adaptation is such that it maxi- 
mises 6(5;). Let 0 be a superadditive measure, 


668; 0 FE) > 002) 4- 6(&) 


and cooperation (or communication) is favoured; the rationale of this being that 
outcomes in Z; © Z,; which are consistently achieved if and only if the state selections 
are jointly determined, yield higher values of 6. 

The proposed mechanism of translation, and by hypothesis of learning, is of this 
kind. Given an adaptive system the originally localised subsystems become distributed, 
ie. E; and 5; coalesce to form 5; U £;. Some a utterances uw in use by 5; come to 
‘mean the same thing’ as a utterances (u, uv) in use by the distributed system. A 
similar process occurs in B to produce some f utterances u that are identified with p 
utterances (u, uw) and at this level identification may be (and we assume in a particular 
system that it is) possible. In this case, because of the original identities between u 
and (u, u), a translation from u € U2 into u € UF is achieved. The underlying physi- 
cal process which, as it were, ‘drives’ the system is evolutionary. On consideration it 
is manifest that some subsystems will fail to survive, hence in reality there must be a 
replacement or reproductive process. Hence translation involves construction of a 
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metalanguage (since functions defined upon [2; U £;]U [2% U 41] are functions 
of the original functions). 

We conclude with a few comments: 

(1) A subsystem is itself a set of utterances. 

(11) To translate these utterances, say from f to a, is a reproductive process. 

(111) Rosen (1958) has argued that self reproduction is self contradictory. ‘Repro- 
duction’ uses a distinguished system such as ‘the environment’ to act as agent in this 
process. 

(iv) In the present case the ‘environment’, which is also a prerequisite of efficient 
reproduction, is represented by the distinct species. Thus 5; and &; in A jointly act 
as the environment which is an agent in the reproduction of 5, in B. 


3.5. The mechanism of learning 

Although we have concentrated upon situations in which learning behaviour is 
manifest, the argument applies to any organisation whatever. It applies to organisations 
involving within a man’s brain, for example, or in an artifact; and to obtain much the 
same result as we obtain by experimenting with the exteriorised behaviour. 

There are several very elegant models in which organisations evolve. Wiener’s mathe- 
matical conception has already been cited; Caianiello’s (1961) model is ideally adapted 
to this field of enquiry; in each case stable modes of oscillation are the entities which 
evolve in the environment of some malleable network, brain or artifact, and which 
we should identify with evolving organisms. Similarly, there are sets of stable modes 
which correspond to the species of evolving organism. 

Instead of reviewing these concepts I shall do the less ambitious job of describing 
a small and in many ways illegitimately simplified model which has been constructed. 
Tested according to our criteria it does behave as a ‘learning mechanism’ and could, 
conceivably, be identified with processes in a real brain. The model arises from the 
proposition that /earning is the characteristic behaviour of an evolving species of sub- 
system in an evolutionary system. A very similar idea was advanced, but differently 
developed, by Pringle (1951). The particular case we have examined is the evolution 
of subsystems that are self organising systems: systems with an always positive rate of 
change of redundancy (Von Foerster, 1960; Beer, 1962). Since this is a model rather 
than an observed assembly, it is constructed in A. 


4. EVOLVING ORGANISMS 


4.1. Evolutionary systems 

Evolutionary systems have been considered in detail in other papers (Pask, 1958, 
196la,b). Briefly a system £ is evolutionary if: 

(1) the states of £ are partitioned into states of evolving subsystems 5; and of the 
environment in which they evolve (called the internal environment to avoid confusion) ; 

(2) there is a competition between the evolving organisms 2; for some restricted 
commodity (food, energy, variability, specific metabolite) which is necessary for their 
survival (we can think of the structural fabric of the organism decaying unless the 
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process is countered by a synthetic activity and suppose that the organisms compete 
for whatever commodity is needed to maintain this activity); 

(3) the subsystems 5; of = reproduce their organisation in order to persist. Hence 
any subsystem or any coupled combination of subsystems is an organisation. We 
shall not be concerned with specialised reproductive mechanisms like sexual repro- 
duction and it will do no harm to envisage only one reproductive process, namely 
interaction between a structure which is synthesized from raw material by a controlling 
process and this controlled activity. Thus activity induces the structure, whilst the 
structure implies the organisation which induces the activity needed in order to build 
the structure. Hence: 


STRUCTURE 5> ACTIVITY 


(4) reproduction entails replication with occasional variation. Commonly there are 
many structural templets L in a set # that will induce the same activity R or an 
activity that is equivalent in the sense that it belongs to a set # such that allk ce & 
induce some L e ¥. Hence the reproductive mechanism is stabilised by a redundancy 
of structure and function which is represented by a many to many mapping from 
£ to and & to L. In other words at any moment, ¢, there will be one particular 
structure Lo € # which may induce anyre & att + 1. Suppose Ro is induced at 
t+ 1, any Le Y may be induced at ¢ + 2. 


I 

STRUCTURE H ACTIVITY 
' 
I 


Thus giving on average: 


Fig. 2. 


The organism that is reproduced is the mapping of # = &* and the elements of 
these sets. It is convenient to denote it G and to write L(G) C ¥ as any structural 
component of Gand R(G) C & as any dynamic component of G. 

However, neither # nor & are invariant. To be precise, we should have written 
& (6) and & (6) where 6 is a function of the commodity needed for survival of a 
structure or the maintenance of activity. Thus for 6 = 1 and 6 = 2, we might have: 
Variants arise due to changes in 6 (changing ¥ (6) and #(@)) or due to aberrant 
transformation like: 


* To simulate a viable form of G we choose some characteristic y, such as a distribution of activity 
over the entire system, that depends upon the existence of # and # and control @ so that survival 
of an organisation is conditional upon its satisfying y. 
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or (in a way we shall consider in a moment) due to interaction between a pair of 
reproductive mechanisms. The important point is that variation is the only randomness 
in the system and it is unrelated to the randomness of dice throwing or haphazard 
connectivity that appears in a number of adaptive models. The many to many form 
of # > @ and & > —£ has nothing to do with ‘chance’ although it may lead an 
observer to adopt a ‘probabilistic’ image of the system. 

(5) there are constraints in the internal environment that favour the development 
of specific variants, in this case, of any self organising system; 

(6) amongst these are constraints that favour correlated activity amongst a set of 
organisms realised by a communication between the organisms in this set, essentially, 
Bonner’s (1958) condition for multicellular organisms, and organised populations; 

(7) there is a relation of identity between certain organisms defining species of 
organisms; 

(8) the organisms are ‘built to survive’ that is, they are homeostatic and ‘ultrastable’ 
(Ashby, 1956) (they adapt to maintain the possibility of homeostasis in a changing 
internal environment by maximising 6). 


4.2. Invariance of an organisation 

Identifying a brain with a system such as 2, leads to a picture in which the compu- 
tation performed by the brain is incidental to the basic issue of maintaining the fabric. 
This is a very reasonable picture for the salient feature of a brain (or any other living 
assembly) is that unlike a computing machine, it cannot be ‘turned off’. If it were 
‘turned off’ by accident, the physical structure would decay and without sufficient 
metabolic activity it would decay in a less dramatic fashion. The metabolism is simply 
the energetic side of a reproductive process which preserves organisms within the brain 
(regarded as the ‘internal’ environment in which these organisms evolve) or preserves 
the organisation ‘brain’ itself. 


4.3. The origin of the indeterminacy associated with learning 

Within this framework the apparent indeterminacy of a learning mechanism has a 
plausible origin. Assume that the signals that are supposed to represent information 
in an operational language, say a, are identified in A with the changes of state consti- 
tuting the activity component in a reproductive process: 


STRUCTURE => ACTIVITY 
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The integrity of an informational image depends upon maintaining a distinction be- 
tween signals and their transformation (or the existence of a transfer function for the 
entire network or a component, such as a neurone). But the transfer function is 
determined by the structure. However, the structure is maintained by a reproductive 
process which entails the flux of activity at least some of which we are anxious to call 
signals. Hence signals and structure cannot be isolated. Their indices are conjugate 
variables. Further, the mapping ACTIVITY -> STRUCTURE and the mapping STRUCTURE 
+ ACTIVITY are many to many. Thus we could obtain a partial but consistent form of 
description of an organism G in Ay in terms of a probability distribution over the set 
of structural components of G, but to describe the form of organisation completely 
it would be necessary to have access to a A description as in 3.2. or in the Appendix. 


4.4. Identification of many evolutionary systems in a brain 

Single evolutionary systems, for which it is not difficult to specify a complete de- 
scription, can be programmed on a computor (MacKay, 1958; Pask, 1959). But a 
brain is a physical assembly which is identified with a vast, perhaps an indefinite 
number of different evolutionary systems. Each one (or, in some cases, each subset of 
these) can be identified with an operational language, a, , (in a sense we shall illustrate 
in a moment). But the evolutionary process in the assembly does not consist of the 
separate evolution of these systems; on the contrary, interaction occurs between the 
systems — when the interaction occurs within a or within # it has the calibre of 
information flow, but when it occurs between a and £ or f and a, it has the calibre 
of an identification, which, according to the argument of this paper, is the process 
which distinguishes learning from adaptation. 


5. PHYSICAL MECHANISMS 


3.1. Particular model 

The model we shall discuss has been realised as an artifact (Pask and Lewis, 1962). 
The logical constituents are a pair of variably coupled evolutionary systems having a 
common internal environment and identified with operational languages a and f such 
that a signs are one attribute and # signs another attribute of the same physical event 
(a unit impulse or an action potential). We assume that an observer is impelled by 
convention, or sheer conceptual necessity, to identify the internal environment of the 
system with a rigid framework (which we take to be a network of real or artificial 
neurones), and is anxious to assign definite locations to the physical events and to 
regard one or another abstraction of events as a signal. One result of this descriptive 
method is an indeterminacy, which manifestly has a conceptual rather than a factual 
origin. 

For convenience, I shall tentatively identify parts of the system with parts of a 
brain, using correspondences that stem from a couple of physiological hypotheses. 
(In some respects, the working of the system plausibly fits what goes on in a brain.) 
But the main purpose of identification is to yield a vivid picture. Indeed, (/) the 
system could equally well have been identified at a sociological or a psychological 
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level (2). Proof or disproof of the physiological hypotheses would have little effect 
upon the system, which is a cybernetic model, with a physical cogency that stems 
from the behaviour of an isomorphic artifact. 


3.2. Internal environment 

We assume the conditions of 4.2., some of them in a specialised form. The internal 
environment is a network of malleable (or adaptive) neurones denoted a. They may 
either be real or artificial, but in the latter case their design must embody certain 
elaborate characteristics of real neurones that will appear in this discussion. Full con- 
nectivity would be admissible but impracticable. It is however, important that the 
neurones be in cyclically fibre connected groups denoted A, wherein neurones compete 
for a commodity indexed by 6. In a real brain the 6 competition may entail a mutual 
inhibitory mechanism which does involve fibres or it may depend upon diffusion of 
a metabolite, or it may be due to the transfer of metabolites from the glial -cells 
surrounding the cortical neurones. 

There are a pair of energetic constraints: 

(1) Any region of the network is constrained to manifest a minimum activity, and 
is physically restricted to a maximum activity dependent upon 6 (this condition 
ensures that neuronal reproductive mechanisms are ‘driven’ by undifferentiated 
energy). 

(1) Any neurone is required to maintain a minimum metabolic level and is limited 
to a maximum level dependent upon 6 (this condition ensures that intracellular re- 
productive mechanisms are ‘driven’ by undifferentiated energy). 

Recall that @ is an index of the rate at which structures can be synthesized and 
maintained. We shall be concerned on the one hand, with relatively macroscopic 
structures (plastic changes at synaptic connections), and on the other, with micro- 
scopic structures (molecular synthesis). Hence the physical restrictions indexed by 6 
differ greatly from one case to another. 

Again we posed in 4.2. the objective of survival and the possibility of decay. There 
is no question of neurones decaying as such. But there is plenty of evidence that 
neurones may be more or less reversibly oblivescent, that ‘plastic’ changes can literally 
decay, and that a molecular species must be maintained by specific synthesis. 

The physical events appearing in the internal environment are impulse patterns of 
which the spatial attributes are a signs and the temporal attributes, 6 signs. 


3.3. Memory systems, learning mechanisms, and the evolving organisms 

First, let us consider the evolutionary system over an interval so brief that it has 
little chance to evolve. In this relatively static picture it will be possible to study the 
relation between an a system and a system and the processes of information flow, 
computation and identification. 

Within the static picture evolving organisms which are later to be identified with 
learning mechanisms appear in a degenerate form as ‘memory mechanisms’ or systems 
that reproduce organisations called ‘a memory’. Each organism is a subsystem of one 
or another evolutionary system, the a or the 6 system (we do not suggest that a real 
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brain has only two systems, probably it has an indefinite number; but two is the least 
number required to exhibit these principles). 


5.4. The a system 

We assign an operational language a to an evolutionary system which is similar to 
the resonance structure considered by Pringle (1951). This assignment must, of course, 
be justified later by examining the relation between this and other systems to ascertain 
that it does define the relation between a language a and other languages. 

The evolving organisms are reproduced by interaction between a spatially dis- 
tributed pattern of activity in the oscillatory neurone network and a structural templet 
consisting of plastic modifications of the synapses of some of the neurones a in a 
group A, for example, the neurones, a, as, ie aptoraresea ea a. in group A;. Recalling 
the driving condition of 5.2, the activity amongst the neurones of A1 will depend, in 
the absence of impulses received from another group, Ae, upon the structural templet 
of Ai and any pattern that persists, depends upon a dynamic equilibrium achieved by 
the reproductive mechanism: 


Impulse pattern < Plastic changes 


The organism G, which is being reproduced is a relation between impulse pattern and 
plastic change, is denoted G, the superscript for the operational language and the 
subscript for its identity. 

Equivalently, G’, = Lf = B. Hence G; has dynamic components R (G;) C 2’, 
which are not localised in the network and structural components L (G’) Cc cn which 
are localised within the region 41 of this network. 

Neurones in A; are linked by parallel fibres to other regions like the group, Az = 
Bis Gay oh ect aise Geos and since any dynamic component R(G*) is a pattern of impulses 
it may be transmitted by this parallel fibre arrangement to Ae. 

Now depending upon the state of Az (namely which G are being reproduced at Ag) 
these impulses may induce some L/ and later G* at Az because of which we call r(G‘) 
an a sign for G; (it is neither necessarily nor commonly a unique sign for G'). Hence 
organisms in a may be reproduced at a given location or transmitted through their 
dynamic component to induce a remote reproductive mechanism, for the same organ- 
ism. a Signs are the dynamic components of organisms and are the spatial pattern 
attribute of certain physical events. 

Thus, as a consequence of describing the internal environment in A as a network 
and distinguishing an a system: 

(1) All a organisms G are associated with a reproductive mechanism which, at any 
instant, has a definite location upon some group, A1, of neurones. 


(1) a Signs are dynamic components R(G‘). 


(111) The set of a signs is determined by the spatial pattern attribute of physical 
impulse sequences. 


(Iv) Only a signs have ‘meaning’ (well defined operational effect in a). They are 
transmitted by a couplings of appropriate (parallel fibre) capacity. 
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5.5. The B system 

We shall assign an operational language f to an evolutionary system proposed as 
a memory mechanism by Hyden (1960) as a result of his experiments upon the changes 
in RNA concentration in stimulated neurones. Similar mechanisms have been sug- 
gested by Polonsky (1961) and Barbizet and Albarde (1961). We assume that the 
frequency (or a comparable attribute) of a temporal sequence of impulses is able to 
modify some of the RNA molecules produced by a neurone so that a correspondingly 
modified protein is synthesized. We further suppose that this modified protein (which 
will only be available in neurones that have previously received a stimulus at this 
frequency) is specifically sensitive to this stimulation frequency and dissociates, if the 
frequency occurs upon a subsequent occasion, to yield acetylcholine or some other 
activator. Hence certain neurones embody ‘specific receptors’ denoted b for specific 
frequencies of impulse sequences and since the frequency is represented by a modifi- 
cation in RNA synthesis a neurone a may include vast numbers of ‘specific’ receptors, 
b(1), 6(2),........ b(m) for different frequencies. Hence f signs are frequencies or 
temporal attributes of an impulse sequence, in particular, a # sign is that frequency 
able to induce a specific receptor b. An organism G? in B is maintained by a repro- 
ductive mechanism, which recalling the ‘driving’ condition of 5.2. is represented by 


the double cycle: 
_-@ RNA template A pe Rae: Specific 
Ariino acids i impulse 
Wee eee protein frequency 


Impulse activator: 


which might be formulated: 


which can be reduced to: 

G=LisKh 
if we keep in mind the fact that the structural components L(G) are automatically 
perpetuated (due to the persistence of the RNA modification) and consequently are 
likely to be distributed (any neurone having a specific receptor will retain it with a 


minimum of oscillatory impulse reinforcement). Hence the location of the structural 
component of G in the network at any instant is likely to be a set of specific receptors, 


B, = [2 GBs) ccowsaaters but (j); 61 (j), b2 (i)... baa (3. | 


which will be very large. 

Thus, as a consequence of describing the internal environment in 4 as a network 
and distinguishing a f system: 

(1) All 6 organisms G? are associated with a reproductive mechanism which is 
distributed (since its structural and its dynamic components are distributed) being 
located only in a set B with specific receptors b in most neurones. 

(1) 8 Signs are dynamic components R(G;). 
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(11) The set of # signs is determined by the frequency attribute ofall physical impulse 
sequences. 

(Iv) Only # signs have ‘meaning’ (well defined operational effect) in 6. They are 
transmitted along any f coupling which is any fibre of appropriate (frequency limiting) 
capacity. 


5.6. Identification between a and B systems 

The physical events, impulse sequences distinct in A, have an a and a f aspect. 
Further, although an a sign has no ‘meaning’ in f any sequence of a signs implies a 
frequency of impulses at some point in the network which may be (and by a suitable 
choice of parameters, almost certainly is) a 8 sign. Thus, the flux of a signs generates 
B signs but the f sign (or frequency) generated by a given a activity will normally 
differ at different points in the network. Conversely, many different modes of a activity 
will induce the same f sign at a given point. 

The signs have lost the property of being signals and bear a closer relation to Greene’s 
percepts. 

We have argued that a peculiar, location dependent, many to many mapping exists 
from a to f. Conversely, since any set of frequencies induces a spatial distribution, 
there is a comparable relation between § and a. 

The immediately important point is that our assignment of operational languages, 
a, B, to a pair of evolutionary systems appears to be sufficiently justified because the 
relation between a and f is exactly the many to many, A indeterminate, and organism 
dependent relationship we discussed in 3.3. The doubtful part of the argument is the 
property of organism dependence; to exhibit it notice that the frequency induced (as 
a B sign) by a sequence of a signs depends upon the position in the network at which 
B sign is specified; similarly the impulse pattern induced (as an a sign) by a collection 
of 8 signs depends upon the set of points specified as recipients. Now the relation 
between operational languages, which, we assert, depends upon the organism is ‘trans- 
lation’ or ‘identification of sign sequences’ and since any a organism has a localised 
structural component, position dependence secures this property for any # — a trans- 
lation. However, a 6 organism has a distributed structural component. Hence the 
property will apply for a -> # translation excepting, perhaps, for the case of a com- 
pletely distributed 8 organism. 

Identification appears in our system as a process whereby a set of f signs comes to 
‘mean the same thing to’ a certain a organism say G; in a region A; as a given a 
sign, say R(G;) which is one dynamic component of an a organism G; in a region Ae. 
The phrase, ‘meaning the same thing to’ G; implies that in a A description the same 
change of state is induced by a given sequence of f signs as was previously induced 
or is coincidently induced by the a sign R(G;). Tn other words, a transformation like: 














£e ve Rg Ra 
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is induced either by an a sign or a set of previously incomparable f signs. The same 
comments apply to the f identification of an a sign. 


6. THE EVOLUTION OF ORGANISMS 


6.1. Organisms as decision makers 

In the relatively static picture we have considered so far a group of neurones, say 
Ai, acts as a decision making system which, at a particular instant, resembles a finite 
automaton with a transfer function, determined by G;, and realised as an object in 
the sense of 3.2 and 3.3 say: 

FF; @Y) >" 
where X‘ is the set of a signs distinguishable at the incoming fibres of 41 and Y} is 
the set of a signs distinguishable at the 41 outgoing fibres. 

But it is important to recall that the substance of the organism is the reproductive 
process for G! and not the particular neurones that happen to mediate this organi- 
sation. The organism sits at 41 and uses the Ai neurones as its body, perhaps only 
for a moment, perhaps indefinitely. The grouping condition of 5.2. secured a number 
of positions in the network of neurones where organisms could sit comfortably. Thus 
a feasible A structure if —>- implies a fibre connection and....... a 6 dependence 
and © a neurone, would be 
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Fig. 6. 


4 
and any a organism is an organisation involving loops like a1 S a2 ora, > az > a3 a 
and plastic changes at the synapses of these neurones. 
However, the 41 neurones also mediate parts of # organisms since each neurone is 
likely to embody several specific receptors, b. We could, though with less practical 
point, define a distributed 6 automaton with a transfer function 


F;(Xp® Ys) > Ys 


where X i is the set of 6 signs appearing in all b € Band the Y, . is the set of f signs 
produced by the neurones a that embody these specific receptors. 

From the engineering point of view any neurone can be regarded as a dual purpose 
mechanism which includes a number of functional parts (Fig. 7). 
The incoming fibres provide impulses which are filtered for their spatial component 
by an a filter (the 6 input to this box is a plausible innovation) and give rise to an a 
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decision (plausibly a state change that exerts some 6 effect upon the neighbouring a 
filters). The incoming impulses are also filtered by a f filter for their frequency com- 
ponent and give rise to a f decision involving a specific receptor which is shown as 
a pair of coupled boxes. The neurone produces an output impulse which may be part 
of either an a sign or a sign and the decision to emit this impulse depends upon the 
weight Va, Vg, given to the a, £, decisions. 


6.2. Interaction with evolving organisms 

Hence, any collection of neurones is potentially the location of an a ora B organism, 
if it takes part in an a reproducing or a f reproducing process. The common element in 
these processes is an impulse which is the physical event we distinguish as a sign in 4d. 

Suppose we have evidence to suggest that a certain group of neurones is the location 
of an @ organism we can interact with it in the operational language a by introducing 
stimulus and recording electrodes and correlating the spatial or a sign characteristic 
of its input, with any spatially patterned stimuli we introduce (Fig. 8). 
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At any moment any neurone may be more or less a part of an a or a f organisation. 
Its status in this respect is chiefly a function of Vq and Vg which, however, we must 
assume to be variable. Now we have already asserted that a neurone is, in a restricted 
sense, a 0 maximising device and it is no innovation to remark that if the neat boxes 
did exist (in real neurones, at any rate, they do not), then one of the 6 maximising 
adaptations would be a variation of Va and Vz. 

Hence, suppose we a interact with the neurone group at region A; using such a 
correlation function in our external feedback loop that 9 is maximised (and if we do 
not our interaction cannot persist) then the A; neurones will be induced to increase Vg 
and behave as part of an a organism. A similar comment applies in the case of a 
stable, @ maximising, 8 interaction when the A; neurones will be induced to behave 
as part of a 6 organism. 

This is the dilemma of any conversational method for observing a system. But unless 
such a definitely a interaction or definitely f interaction is stabilised, the organisms in 
the system will have a and f aspects. To some extent, we can regain determinacy and 
the chance to know, for certain, the implication of a sign, by talking, as we have done, 
about an a organism identifying the dynamic component of a f organism as a f sign, 
or vice versa, but this expedient has doubtful value whenever it becomes necessary to 
admit structural a and f interactions. In fact, the commonest organisms are linguistic 
hybrids, which is why the system is difficult to observe in any other than a conver- 
sational way, that is by changing from a to # as necessary, in order to maintain the 
discourse. 


6.3. A self organising system 

In 5.2 we defined the internal environment as embodying a set of constraints which 
would favour the evolution of an organism if it was a self organising system, in other 
words, only a self organising system would receive an adequate amount of 0. 

Consider an a decision maker in the sense of our previous discussion. It will be a 
self organising system if and only if the rate of change of the redundancy of its be- 
haviour is positive; namely, if and only if A redundancy (x, y)/ At > 0. 

In a variable internal environment a 9 maximising adaptive system can behave as 
a self organising system in this manner; adaptation will lead to a transfer function 
which maximises 6, but before this occurs, the environment will change, requiring a 
different kind of adaptive modification. Thus, in a certain sense, the system is always 
adapting; and the redundancy of the behavioural outcomes is always, on average, 
increasing. 

This analysis is tempting but spurious because it relies upon redefining the set ¥ 
and hence redefining the ‘system’ (Beer, 1961). For a given set X (namely in an invariant 
or stationary environment) the system (with behavioural outcomes XY © Y) will adapt 
to maximise 8. The redundancy will increase for a while but will eventually become 
constant. At this point the system will no longer behave as a self organising system 
and will no longer be favoured with an adequate value of 6. Since the physical structure 
of the organism depends upon @ it must become a self organising system in order to 
survive. 
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6.4. The mechanism of evolution 

To become a self organising system the domain of the transfer function, which is 
aset X¥; ® Yj of behavioural outcomes must be transformed (X1 © Yi) > (X2 © Ye) 
or (X1 ® ¥1) > T(X1 © Yi) where T is defined as a transformation that maintains 


A redundancy (x, y,)/ At > 0 
or equivalently, writing 5 and T(£) that maintains 
A redundancy (£)//Ar > 0 


Hence, although no system = with an invariant domain can be a stable self organising 
system, the disjunction &* where 


r=R 
st = Yo [T'(2)] wherer = 1,2......... R 
r= 


is a stable self organising system if it exists. 

We call &* the evolution of & and comment that its existence is guaranteed by the 
physical constraints in networks such as Caianiello’s (1961). But the evolution can be 
either of two kinds. 

(1) The set of behavioural outcome states is transformed or evolves due to a corre- 
lation between the activity of a larger set of neurones. In this case the a organism 
expands to occupy a larger volume in the network. The mechanism of evolution is 
transmission of a signs (say from Ai to a neurone group, harbouring a similar activity 
like Az). The result is resonance between the organism located at Ai and Ag to yield 
a system with outcome states (X} U X;) @(Y{U Y%). Let us call this regional evo- 
lution. 

(2) The set of behavioural outcome states is transformed by the inclusion of f states 
whereby the organism becomes equivalently capable of identifying B signs or an a, B, 
hybrid, or in the limit case, a definitely 8 organism. The mechanism is a change in 
the weights V_ and Vg (increase in Vg/Va + Vg) of neurones only some of which will 
ordinarily be in the group A. The resulting organism will have a set, (Y A U Q(X : 9) 
(Yt U 2(Yz) of behavioural outcomes, but (since ¥1 and Xz and Y; and Yzx are 
probably connected with entirely different neurones) this assertion has little practical 
importance. The process, however, is important and we shall call it linguistic evolu- 
tion. 


6.5. The necessity for evolution 

It is not difficult to show that if the evolutionary system is left alone, the organisms 
will evolve in the regional sense of (1) and by the linguistic evolution of (2). The 
argument will not be given in this paper but we comment that the result is a con- 
sequence of the general principle that one evolving organism will exist in an internal 
environment increasingly determined by the other organisms of its own species. 


6.6. Learning related to evolution 
Later in 8./. and 8.2. we shall contend that the regional or the linguistic evolution 
of an organism has the characteristic required for mechanisms that mediate various 
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kinds of learning behaviour. Before developing the point it will be worth while 
considering a related case that illustrates the enormous generality of this pro- 
cess. 


7, DIGRESSION INTO PHYSICAL ASSEMBLIES 


7.1. Solving an undecidable problem 

Consider any automaton able to receive evidence from a set of possibilities U and 
to select actions from a further set of possibilities C. Let the automaton depend 
upon 9 for physical survival, and let it be programmed to maximise the @ available 
in its environment with a transfer function f; (U @ C) -> C which is often called a 
decision rule. Suppose that this automaton is presented with an undecidable problem; 
in other words, given certain evidence uo ¢ U and having selected co € C, the decision 
rule specifies no change of state, which will increase 6 yet vo remains invariant and in 
uo, Co, the value of 0 is decreasing. Without further specification the automaton is 
unable to survive. It must remain in uo, co, until it decays. 

The organisms we have considered, are momentarily like the automaton. Their 
strategy for avoiding an undecidable and destructive situation is to evolve into another 
organism which may be able to solve the problem. 

The present automaton could also be designed with this strategy in which case its 
response to an undecidable and destructive situation would be U > U*, UC U* (to 
obtain novel evidence on the basis of which to decide) or C > C*, CC C* (to select 
different actions in an attempt to elicit further evidence). The fact that most automata 
are not designed like this is due to the form of evidence commonly presented, namely 
the values of one or a limited number of continuous variables. The set of possibilities 
must still be increased in order to solve the problem but when the environment has 
this very special structure it is possible to invoke the doctrine of chance events, to 
throw a dice and select amongst the existing set of possibilities by chance; for these 
possibilities define the environment completely. 


7.2. Decision in unstable equilibrium 

Let us now reduce the choice situation to its most rudimentary form, namely a ball, 
balanced on a pin, and able to fall in various directions. Suppose the point of the pin 
is progressively narrowed so that the equilibrium becomes increasingly unstable. 

Now this rudimentary situation is ‘undecidable’ in the sense that at any moment 
when the ball rests on the pin there is nothing to ‘predispose’ it to one direction of 
falling rather than to another; otherwise it would have fallen. As the pin is narrowed, 
the ball must eventually topple over, when we assume that it is replaced and that the 
process is repeated. 

The event of toppling over can be described in two different ways: 

(1) As an event determined by some physical variable or combination of variables 
such as a breath of wind or a magnetic field. As the pin narrows an increasing number 
of these variables exert sufficient effect upon the ball to determine its descent. Hence 
narrowing the pin has the effect of increasing the set of significant variables, that is, 
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of variables which significantly determine the outcome and of which there are an 
indefinitely large number. 

(2) As a matter of chance, for we are ignorant of, or indifferent to the determining 
variables. 

Ordinarily, either (1) or (2) is acceptable. But obviously it would be essential to 
use (1) and not (2) if the ball could somehow profit from falling in one direction, say 
X and if being pushed by the breeze determined its descent into Y when the pin was 
less narrow and the system more stable, and if, as a result, the ball adapted to become 
more sensitive to the breeze than to the other variables. This adaptive process is 
distinct from adaptations that select a ‘value of a variable’, such as ‘the strength of 
breeze’, which may also occur. 

A comparable selection from the states of an indefinitely large number of possible 
variables takes place in biological evolution to determine, for example, the sense 
modalities of organisms, Trivially, in the evolutionary system we have considered, 
or non trivially in the maturation of a biological brain, the selection is made, instead, 
from forms described in an indefinitely large number of possible languages. 

We are now able to relate entities in the evolutionary system to learning mechanisms 
in a very general way indeed. Having demonstrated the diversity of systems capable 
of embodying these principles, we need no longer restrict our attention to a pair of 
operational languages. The evolutionary system, when realised in a physical assembly, 
has a special and important ‘analogue’ property (MacKay has pointed out a rather 
similar property in other systems). Conventionally we are accustomed to analogue 
devices that select from an indefinite number of values of a variable (the device 
works with continuous variables). The physically realised evolutionary system selects 
from an indefinite number of variables or an indefinite number of operational lan- 
guages. 


8. DIFFERENT KINDS OF LEARNING* 


8.1. Forms of learning mechanism 

The limiting case of a learning mechanism is any system with adaptive properties. 

These are a prerequisite of learning or evolution. Consider discourse in @ with the 
evolutionary system we have discussed and suppose that the organism or set of 
organisms involved in the feedback path of the a signs of the discourse is in a condition 
where equivalently: (/) it must evolve in order to survive; (2) it is no longer a stable 
self organising system; (3) it is presented with an undecidable problem; (4) it is un- 
stable like the ball on a pin. 

Then evolution occurs which will be manifest in the a discourse as a learning be- 
haviour. 

If regional evolution takes place the possible behaviours are stimulus or response 
generalisation or the development of a body of related experience; knowledge could 
be accumulated in terms of different orders of conditional probabilities of events or 


* See also the Appendix. 
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existing constructs; but no organisation other than the organisations predicated in the 
a system can occur and hence no essentially novel concept or relation can arise (for 
even variation in the system which may be interpreted as chance trial learning, induces 
organisms predicated in the a system). Briefly, then the process of regional evolution 
implies a mechanism that can account for inductive learning (and, by extending the 
model to include its response selection) inductive inference in the sense of Braithwaite 
and Popper (1962). It may also account for change of attention within a field made 
up of similarly ordered objects. The apparent indeterminacy entailed in the learning 
process is due to the intrusion of the signs (or the hybrid character of the evolving 
organisms) that (in many systems) is also the chief progenitor of variation, 

If linguistic evolution occurs the resulting behaviour can manifest originality pro- 
viding that the system is conceivable to whoever indulges in the discourse. a Signs 
are identified with a set of f signs and vice versa. The evolutionary level at which 
identification of a and f signs is deemed to take place depends upon his discrimination 
and is not otherwise demarked from the hybrid interaction which is discounted as 
irrelevant. So, to use a rather inadequate analogy, some a signs which appear ‘hazily 
valued’ in the discourse will at some evolutionary level, assume ‘sharp values’ as a 
signs identified with sets of 6 signs. 

The mechanism of linguistic evolution can give rise to novel concepts and relations, 
for example, the relations involved in constructing the conceptual hierarchy of a 
hypothetic deductive structure in the sense of Braithwaite and Popper. It can account 
for attention to previously incomparable objects or entities. Finally, insight is one 
behavioural consequence of such a system. The flux of a discourse is likely to be, 
perhaps certain to be, mapped into the interaction of 6 organisms where it remains 
as an inaccessible ‘hazily valued’ # representation until linguistic evolution takes place 
and sets of f signs are identified with a signs and become ‘sharp valued’. Before this 
occurs, the data may have been modified in its 6 representation and, if so, we should 
say that this identification was insightful. Hence, according to our original criteria, 
learning, in this system, entails linguistic evolution. 


8.2. Relation between organisations 

In conclusion we should stress the physical characters of this system. Perhaps we 
have insisted often enough that any organism is designed to survive as a physical 
entity; that its ‘decisions’ are made to maintain its fabric and that the abstraction, 
6 is only a convenient index for whatever energetic conditions are needed to maintain 
and construct its fabric. On these grounds alone the form of organism that evolves 
must depend upon the physical assembly that embodies the evolutionary system. 
Without this material dependence* our conclusions would be limited to systems which 
literally had only an a language and a f language. 

But the physical character of the system plays an equally important part in de- 
termining the relations, in particular, the similarities, that exist within the system, and 


* The appropriate abstraction for such a materially dependent system is an infinite automaton,as 
discussed, for example, by Loefgren in a recent paper (our evolving organisms are special and rather 
intractable cases of infinite automata). 
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in maintaining a correspondence between these and the relations that pertain in an 
external environment that is built of the same material (MacKay, 1958; Beer, 1961). 

In terms of evolving organisms there are relations between individuals and their 
ancestors and between species (the relations required in 4.4. and realised by the 
material constraints). Corresponding to these are relations between forms of sign in 
a given operational language and between operational languages; physically in the 
a, B, case, these are realised as relations between spatial patterns or between frequencies 
and relations between spatial patterns and frequencies. Finally in discourse there are 
relations between concepts of the same kind, and concepts of different kinds. 

The material dependence of a brain or an artifact suggests that manifestly coherent 
experience stems from a cohesive physical or intellectual environment of which the 
brain is a specialised part. 


APPENDIX 


Comment upon algorithms 

In the discussion Professor Rashevsky suggested that the Markovian theory of 
algorithms could be profitably applied in connection with this learning model. A 
tentative approach will be outlined, since the method seems to be promising. 

We interpret an algorithm as a computing procedure which has conceptual integrity 
(that, in some sense, J could reveal the algorithms it uses whilst remaining ignorant 
of their stages of application, though each stage entails some mechanical change). 
Conceived in this way, an algorithm is a process which reduces 5’s conceptual un- 
certainty (given x, y, and x about the next y). 

Algorithms 7 act upon a vocabulary which is some sequential representation, t of 
X¢ to produce words in t (X®) or otherwise to produce words in t ( ¥). Given (xo, yo), 
= selects some algorithm 4 € U(q) where all 7 € U(q), either: (1) satisfy the function 
fy selected, at this instant, from F@ in the sense that 7(t(x0)) = fo(xo, yo)3 x0 € X%, 
yo € Y®; or (i!) are the first member of a finite sequence of m algorithms 


Ym {ym-1[ Oe ede gd y l [t(xo) ] ibe se }} = fol(xo, yo) 
where any 
7, [x)ler(xX*u ¥*%) forms y 5 1. 


‘Conceptual’ uncertainty is envisaged in terms of the average number v(q) of algo- 
rithms which are required by ©, given xo, yo, to satisfy So € F@, In other words, we 
interpret the average length of sequence of algorithms that is needed to ‘solve a 
problem’ as an index of the uncertainty engendered by ‘posing’ this problem and 
learning is conceived as a process which maintains the rate of reduction of v 
within limits, say, §1 S —Av/At S £9; &: > & > 0; (hence & is a self organising 
system). Consider the possibility of interaction between this ‘conceptual’ level of index 
and state indices at a mechanical level; in particular, that a control is exerted to 
stabilise — Av/At, by adjustment of these ‘mechanical’ parameters. 

We have distinguished certain underlying mechanisms of learning, namely, ‘adap- 
tation’, ‘regional and linguistic evolution’. These occur in a medium or ‘environment’ 
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(such as a brain or artifact) wherein the rate of activity must, on average, remain 
constant. Adaptation is a mechanism whereby repeated events ingrain structural 
patterns; in a spatial distributions of synaptic impedances and in f more or less 
ramifying, frequency sensitive pathways. Given the mechanical particulars a pair of 
indices of the degree of adaptation are calculable; say og and og. If oa is high then 
gy > go, Whilst & is embodied in the particular part of the medium (or internal environ- 
ment) for which og was calculated. 

Regional evolution is a process whereby activity correlations increase the volume 
of the medium that, at any instant, is embodying 2; once again, given the mechanical 
particulars, an index of correlation is calculable; say g. The higher o the more elaborate 
the mechanical concomitants of any ‘conceptual’ event (the application of an algo- 
rithm 7). 

Now adaptation implies a restriction upon the range of the available algorithms; its 
effect is unspecific but, in general, since adaptation implies y > goand since v(qi U go) 
> v(go) the result of adaptation will be a reduction in v and increasing the rate of 
adaptation will increase — Av/At. 

At the conceptual level, decrease in u(q) implies a composition of algorithms; the 
selection of 7* [t(x)] = m1 [ne [t(x)]] in place of m after 72. Composition increases 
the number of members of U that have been used at least once. Let y(n) be the 
number of stages (which do correspond to mechanical events) entailed by the appli- 
cation of 7. Commonly, y(74*) > v(m) + y(y2) as in Rashevsky (1962). We shall 
interpret the process which provides the mechanical elaboration needed to achieve 
composition with regional evolution (notice, that, in addition, the regional evolution 
also shifts the embodiment of 5 to some other region of the medium with a possibly 
lower value of oa; hence a homeostasis). 

Finally, let us identify Jinguistic evolution with the augmentation of a vocabulary. 

As in Rashevsky (1962) some algorithms, such as those that replicate a word in 
sequence, cannot act upon any word in a given vocabulary but require additional and 
distinguished signs which take part in the process (although these signs are unchanged 
in the final result). Consequently algorithms of this kind act upon an augmented 
vocabulary, and Dom (U) increases. (Since the augmenting signs must be necessarily 
distinct, mere addition of a signs would not be sufficient.) There is a reasonable and 
tempting correspondence between /inguistic evolution involving change in the domain 
(and/or range) of © and the augmentation of a vocabulary from 1(X2) to 1(X@ U X8) 
and 1(Y¢) to t(¥*U Y). In this case it is perhaps better to view the appearance of 
algorithms of this form as a consequence of the reproductive process which we have 
argued is the mechanical backbone of learning. 


Comments on system representation 

Let us relate the concepts of 3.4. to the formal representation of Rosen (1958) and, 
in particular, deal with the special case considered in detail throughout 5./.-5.6. and 
6.1.-6.5. 

(1) Suppose a state description of a physical system, 5, in the scientific language, A, 
yields a set of identified states z € Z. These are partitioned into subsets Z; (equivalence 
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classes Z/o induced by the partition e) which correspond to subsystems, 2; € 5, 
PE Te Dhak ois n, 

(2) From Rosen (1958) a system is determined by a directed graph in which the 
directed edges correspond to mappings and the nodes of the graph to the conjunction 
of the domain, Dom (F), of some mapping, F, with the product set formed from the 
factors of subsets in the range, Ran (F), of other mappings, all Dom (F) and Ran (F) 
in © being defined. 

Thus ©; € F;; Dom (Fi) > Ran (Fi). If & is the physical embodiment of a com- 
munication system, 2; € Fi; Dom (Fi) > Zi. 

We specify, also, that Z; is a factor in Dom (/4), hence: 


EB, eFis (Z)@........ Zj+m)® Zi] > 2 


where Zs Zz are subsets of Z;, Zm,j == 1,2,....... n,noti,andj-+m=j..... n, 
not /. 

(3) For adaptive subsystems (all subsystems of interest are adaptive) we define a 
parameter, i, max > gi > 1, such that gi is an index of fyi ina set Fy = | (ty), 
when 5; = foi; [(Z, ©.......... Z, mm) & Zi] > Z and 5; is adaptive if, given a 
stationary sequence, [x] C ; (a redundant condition since all Dom(/) and Ran(F) 
in & are defined and are connected with similarly specified subsystems) a convergent 
sequence [qi] of values of q; induces a convergent sequence [z] C Z; so that, after 
an interval of adaptation, [z], is in a subset Z, C Li. 

(4) Adaptation is directed. The convergent sequence of values of g; is selected to 
maximise some variable which we shall identify for the moment with a payoff available 
to ©; (later we shall give a very different interpretation to this variable). 

Formally, let G; be a function with domain the product set, Z; ® [z; ®.... Zj+m] 
where n S$ m-+- I which assigns a numerical value to each state in Z; © [zj ©... Zj+m] 
(each element of this set designates the state of 4;, immediately defined as some 
element of Z;, in relation to the states of m + 1 other subsystems). Let 6; denote the 
average value of G; over an arbitrary sequence of changes in state. 

Adaptation is defined as any process which selects those values of ¢; which maximise 
0;. Since we are not concerned with mechanism so much as a model of mechanism, it 
is unnecessary to comment upon how this selection is achieved. We take it as a 
characteristic of the physical medium in which the system is embodied (1) that se- 
lection of y; (or whatever corresponds to our representation of an adaptive parameter) 
does occur and (11) that 0; maximising selections lead to convergent sequences [q;] of 
the values of g; (hence to an adaptive change). 

Finally, we comment that the selective process does entail the idea of ‘chance’, 
although only in the weak sense of an ‘independent’ process which decides between 
alternative values of q; in the absence of sufficient evidence. Consequently, ‘chance’ 
is subsumed by the characteristics of the physical medium. 

(5) Now £ embodies communication systems which will be separated from © in a 
moment; that is, we shall consider certain information structures needed to maintain 
the physical stability of = apart from the underlying physically constrained organisa- 
tion, But a realistic separation rests upon the fact that, in terms of 0; maximisation, 
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which is the basis of our stability criteria, communication is worth while, otherwise 
it cannot, physically, and consistently occur, excepting by accident. We thus assert 
that G; is so defined that certain elements of the set Z; © (zj ®....... Zj+m) Which 
can only be consistently, that is, non accidentally, achieved by correlated action on 
the part of several subsystems, ©; £m, in conjunction with 5;, have higher 6 values 
for =; than those which can be achieved independently. G; induces cooperation which 
implies interaction and may entail communication. 

(6) To derive a communication system from its embodiment in 2, we perform an 
abstraction whereby certain subsets of states are identified with signs. Now we are 
concerned with those physical systems, 2, which, on abstraction, yield at least a pair 
of distinct communication systems, A and B, characterised by operational languages 
a and f. (In the model of 5./.-5.6. and 6./.-6.5. the abstractions define the spatial 
pattern and the frequency attributes of physical events.) 

Formalise abstraction by a many to one mapping J¢ or J® when J4{(Zj) = Y-, 
JZ) = Y?. Hence a communication subsystem 5) € 4 is: Ff =f"; X) © Yi > 
Yii fi, €F, = J¢F), and where X/= Yi @.......... Y° (similar definitions 
apply in the case of Be € B). Finally, as a convention, we include one indifferent, or 
ineffective, value in each Y x or each Yi. 

The more familiar probabilistic image of a communication subsystem is: 


E=PiX @Y > 
where P* is a matrix of empirically estimated joint probabilities of utterances, x, y, 
say p(x,y) and the probabilities p(fg:) = [p(yi = 1), p(@i = 2),..... P(Pi = Pmax) ] 
may be inferred from these empirical estimates. 

(7) By previous definition communication in A is possible if and only if the directed 
graph of A, namely of J@(=) has cyclic subgraphs; or in B if and only if the directed 
graph of B, namely J4(2), has cyclic subgraphs. An alternative condition is that =. 
communicates if and only if: 


Yi3 Mic Y;; xX" Go as SN re ak Yr, where either (1) r = 7, or (11) x. Cc Y; 


(8) The a vocabulary is X¢ U Y#; ¥4 = UX), Ye — UilY))s the f vocabulary is 
XB Y8; X68 =U (X?), Yo = Ux ¥:): The possible utterances in a are product pairs 
x, y,EX4 & Y4and in f product pairs, x, y,e X* © Y4 but those which are achievable 
depend upon the functions, fj and f i which define the communication subsystems. 
Any utterance determines a many to one relation between subsystems = zs and 
some particular subsystem 2 ‘ € A, the character of which depends upon the form of 
system graph and the distribution of the values of the q; (for the domain of a sub- 
system is a product set, its range is a factor in a product set). A similar argument 
applies to & < E B. 

Most utterances have only a local connotation. 

(9) Legitimate utterances x, y, = u € U*and x, y, = u © US (where U% and U* are 
sets of legitimate aand f utterances) ‘mean the same thing to’ all subsystems 27 € A 
(or all 5" € B), 

To define ‘meaning the same thing’ we must refer to the physical embodiment of 


References p. 212-214 


208 GORDON PASK 


the communication system; for perhaps the least non trivial definition, that x ‘means 
the same thing to’ 2” as it does to a. is that, in the same conditions, receipt of x 
induces the ‘same state’ in 57 that it does in 7. 

(10) Let zo be a state in Z and let ¢ be some relation of equivalence in Z. Thus if 
Zo € Zz, then zo, 2, are equivalent states and the equivalence class of zo with reference 
to e is denoted as Zo/e. Choice of a particular relation e, though crucial in our 
argument, is not determined in the formal representation; except in the trivial sense 
that the conceptual framework must not be contradicted by the chosen relation. In 
particular, if zo e z, are regarded as the ‘same states’, signs that ‘mean the same thing’ 
must be able to exist. 

(11) Consider a pair xo EX, x1 € X4%, with xo € Dom(f, yc > i and similarly with 
x1 € Dom( f, 6 Xt Let xo z x, imply that xo ‘means the same thing as’ x, if and 


only if for each = = eAand & =, e A and for the same 2p € Z it is true that: 


(1) when (y € ¥:) = (ve ¥;) = 3, and 
(1) when gi = 9 = o, Yo a value of , fot (x0, ¥,) € zo/E A Zand that fry (x1, ¥) ELIE O Zi. 


(12) We apply the plausible constraint that J¢ maps zo/e QM Z; onto a single member 
ya of Y; (which may be the indifferent element), similarly that ¥ maps Zo/e MO Zp 
onto a single member yy of 4a 

(13) In this case xo/z is a sign with the common meaning of the set zo/e. Any 
Xa © Xo/7, Xy € Xo/%, may be combined with ya (if xg € X@ also) to yield a legitimate 
utterance Xa, Ya = ua € U4 and, if x» € X%, to yield a legitimate utterance x», yo = 
= up € U8, By ‘combined’ we intend for xq ‘presented’ (given ) selected by Rai “anda 
suitable value of qi) to 5 1 (OF, in the case of the x», the same comment with iefetstice 
to & * and (x, Tather than £ P=} ; and yi). 

(14) Sets of signs are informative if there exist equivalence classes like Zo E, 2/E.. 
and values of qi like yo, gi,....... Such that if gi = qo, then f’ (xo, Y, yéezo/E z, 
implying yo € Y' and if = i, then instead f, (xo, y)enl/E A Zi implying yi é ee 
since, in this case, the selection of y € Y, indicates the adaptation of 2. 

(15) We are now in a position to specify the composition of the sets of legitimate 
utterances in the operational languages a and f, namely the sets of product pairs U2 
and U8 which are defined as being disjoint, U7 U8 = 0. 

Let X/x be the set of equivalence classes x,/z. 

Let Z/e be the set of equivalence classes zy/e. 

U= 0 {u fx/2 0 LU (Dom (Fr) J} ® yr} 
xin 2, /E E%c4 
in in ‘ 
Xin ZIE 
Similarly, for US, if 5° «A is replaced by 5 € B and if Dom(F() is replaced by 
Dom(F ): 

(16) Coherent discourse in a or £ (incidentally, also informative discourse in a or ), 
consists of sequences of legitimate utterances (this is not, of course, the only discourse 
in a or f; for some values of Pe =, * will, given xo, produce y such that xo, y is not in 
U2 and similarly for gx, = & and signs not in Ué, But insofar as a and f are opera- 
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tional languages a coherent universe of discourse must exist and communication 
within it is characterised by sequences [ua] C U% and [ug] Cc U4. 

(17) The factors of the x in a coherent sequence of discourse [va] = x1 > 1 > 
> x2 > po > must be pre IM“ Zr} E). For since the communication is coherent any 
selection of } from a (or, for & » from Y *) must correspond to some jy € JHE) 
[or in the case of one of the B subsystems to yr e J0(Z,/E)]. But X¢ and xe are 
product sets with factors x and Y4; but the only values of these factors Wich are 
achieved in coherent diseourse are signs or components | of signs y = yy, as above. 

(18) Hence » = J&{(Z,/EN Zi), py € Y’, for some r; p = J&(Z,/E A Zz), ¥ € Y. 
for some r. Further, if in the directed graph, subsystems 2 A and & . give an input to 
subsystem a; forx C ba — x= JU(Zzq...6.... Zp), Where the state z_ € Za/E A Z; 
and state z» € Zp/E A Zm. Thus, using similar arguments for B, we arrive at the 
structure indicated in Fig. 9 below. 

The physical subsystems ©; with behaviours that generate the discourse, are adaptive 
in the sense of (3) and (4): the values of q converge to a definite value qo. Regional 
evolution entails the stable ‘cooperative’ coupling of these subsystems (in the sense 
that the measure of Dom Soi Dom So , increases as a result of adaptation) and it is 
denoted as ‘&;, 5; > &; U Fy. 

Notice that the utterances of = are no longer distinct, in A, from the utterances 
of F n and become ‘joint utterances’. In order to effect separation it would be necessary 
to discover and attach the name ‘i’ or ‘j’ as a parameter of each utterance. 

Condition (15) and condition (7) imply that in order to preserve the relation 
EY = J 5;) between Zand 5; as well as U4 4 UP = 0, it is necessary to choose # 
and E so that no cyclic subgraph of the system that includes 5; such that J9(5,) = 
&" € A also includes 5% such that J6(Z,) = Ag e B. Hence FU &, is a prohibited 
coupling. 


&- Discourse [Ug] * Jo—= Xj} —> Hy —™ Xp —™ Vo 


By =; 
o- Abstraction Rha! Raa! gx, 
Za 802; To ae a 208-2; To other=: 
State 
description @ fue aes ® ie ee —e 
Zz /tkbp Zen 
: ic ie fe mt V other = 
Time to—e to tA t —m fo +2At ——m ty +3At—e to +4 tf —e 


Physical embodiment 
ofzjat any time inducing ray other 


particular sample of Subsystems 
discourse. ‘By = 7 
Choice Gi 
9, aaa 
Fig. 9. 
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(19) There are thus several ways of discussing the system. The most comprehensive 
refers to the physical embodiment. Here the relations o and e are preserved and the 
abstractions J@ and J® have a definite form; in the case considered, ‘frequency’ and 
spatial pattern attributes of physical events. Next there are the communication systems 
derived by J¢ and J@ which are related through the equivalence classes induced by «. 
Finally, there is the discussion of discourse characterised by a relation z induced by e; 
and, speaking in these terms, it may be impossible to preserve the identification be- 
tween communication systems = ce and the originally chosen physical subsystems, 
Si, Ze. 

(20) To preserve the integrity of a and B we need a condition like U2 MN U8 = 0. 
On the other hand, the possibility of translation depends upon af and fa interaction 
which is formally represented by a many to one mapping Q; (with parameter / such 
that if Xx, C X® then possibly Q; (X,) Cc Don(F;), and vice versa for a to £). But in 
order to preserve the af distinction if QUX,) G Dom(F;) then it is mot true that 
Q(X | aux for any x € X@, Hence the subsystems we have considered cannot translate 
coherent discourse. 

(21) Translation, in fact, entails the construction of a different logical type of 
statement; hence from the viewpoint of an observer an uncertainty regarding type 
(some care is needed in the interpretation of this remark); for manifestly different 
types of statement are involved in a discussion of the structure in (19). But translation 
entails the development of higher orders of function. Hence an observer’s propositions 
about the translation are propositions of different and higher type. In particular, his 
linguistic interpretation of the system leads him to discard his definition of subsystems 
as o localised entities). 

(22) A formal representation of what occurs when a subsystem (at least of the kind 
we have considered) translates a coherent sequence of discourse leads us to reinterpret 
our idea of a subsystem or equivalently of the function it computes (the trick involved 
is that the basic distinction between the subsystems ©; rests upon consideration of 
the entire structure in Fig. 9 and # is chosen to satisfy this distinction). So far we 
have been concerned with utterances U4, and U#, determined by subsystems 
age = and S. cae that are characterised by Order (1) functions, F*, F; and F*, ae 
Now consider the construction of Order (IT) (or of higher order) functions determining 
utterances’ in U4 ® U4, U8 & UB, or since this is not prohibited, U2 @ U®. 

Denote: (1) xa in ua € U%, xo in un € UP, 

(11) Gu == (xa, Xa) in (Ua, Ua) € U* &® U* 
Ep = (xo, xv) in (up, uv) € UF @ UF 


E. = (Xa, Xo) in (ua, uv) € U" © UF 


If x can be interpreted at Order (II) (the assumption that it can is the ‘similarity’ 
condition of 3.4.) then none of these relations are prohibited: 

(1) xa % €a, Xo m &y, (which are not translations) or 

(2) a % &, &y m Ec, hence &@ 2 €y when from (1), xa a x» which implies the possibility 
of translation. Now to say ‘xg 2 Xp’ does contradict the original constraints, imposed 
upon the system. if we interpret this as a proposition of Type I (referring to the Order 
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(I) functions) but it does not contradict these constraints if it is interpreted as a Type IT 
proposition (in other words if ‘mean the same thing as’ is now conceived as a relation 
involving Order (II) functions and the subsystems, whatever these may be, that they 
characterise). Alternatively, we can define a new relation z* such that ‘xg 2* xp’ is a 
Type (I) proposition that is true if and only if the Type (II) proposition ‘xg 2 x,’ is 
true. 

(23) From (18), regional evolution implies 5;, 5; > &; U 5). It can be argued that 
=; U &; has a behaviour (and that the corresponding, coupled, communication 
system generates a discourse) that is determined by a function of higher order than 
the Order I functions fy, and fp: that are members of its domain (equivalently the 
coupling introduces further axioms into the system since the graph is redefined). 
Linguistic evolution, entailing the ‘translation’ or ‘indentification’ of signs, should 
involve couplings like 3; U 2x, but these are prohibited by (18). However, the 
coupling ‘((2; U &3;) U (2% U &)Y, which is interpreted as a stable, cooperative, 
association between the coupled systems £; U 5; and &;, U &) is not prohibited. 
Notice, however, that as in (18) the utterances of Er are inseparable from those of 
E or those of = from =P. Hence, if a mapping, 2, of utterances is described in the 
original descriptive framework, in A, it must carry the names ‘i’, ‘j’, as a parameter. 

(24) Consider, in particular, the special case when a coherent sequence of a dis- 
course “4% describes a subsystem (since subsystems are identified with functions, it is 
sufficient that «2 exhibits the complete behaviour of a subsystem). Then a translation 
of u* > uw is a reproduction of the subsystem concerned (notice, (/) that this is the 
case we chiefly examined in 5./.-5.6. and 6./.-6.5., and (2) the close relation between 
this case and Rosen’s paradox (Rosen, 1959). In order that an a subsystem reproduces 
it is necessary to invoke a distinct subsystem, here the f image of the a subsystem, 
and in order to speak about the process to invoke an idea of logical type. In Rosen 
(1959) the logical type of statements about the system and its distinct environment is 
higher than the original type, of statements about the impossible self reproducing 
system. In the present case the physical correspondent of an a-system is reproduced 
in the ‘environment’ provided by the physical correspondent of a f-system and vice 
versa. In terms of a- or f-discourse this entails ‘translation’. 

(25) Several physical embodiments may be possible; we have considered only a 
particular evolutionary system, the subsystems of which are characterised by a combi- 
nation of the 5;, represented, formally, as ‘3; U 57’. 

(26) Now any communicating group of the 5; is trivially such a combination. How- 
ever, we adopt the reasonable convention of writing ‘5; U 5; if and only if the 
correlation between the states of ©; and certain other states of 5; is 1, is such that 
the interaction between the designated subsystems is not reasonably construed as 
communication; for no doubt is resolved as a result of it. 

(27) To achieve =; U 4; the adaptation rules are modified as indicated in 3.4. and 
later illustrated in 5.1.-5.6. and 6.1.-6.5. 

(1) Adaptation is locally irreversible (that is with respect to @ and the distinction Z; 
and Z)). 

(11) 0; = 6 (8%) is an index of survival; we assert that the physical medium that 
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embodies =; must be maintained at a cost that is measured in terms of 6(4;). Hence 
the 6(2;) maximising criterion is a criterion of survival, say 0(2;) > 00, for survival. 

(111) The G; are so defined that 6 is superadditive thus: — 0(5; 6 2) > 6(4;) +- 6(2;) 

(iv) To represent survival, the directed graph of 5; is defined if and only if the 
physical embodiment of 5; survives, namely, if and only if 6(5;) $ 6p. 

(28) Finally a replacement rule is needed to avoid the depletion and, ultimately, the 
disappearance of subsystems. In some pertinent sense the domain and the range of 
potentially definable sets of functions are created to maintain an average population 
of n subsystems which differentiate and since 6 is superadditive also aggregate by 
adaptation. 


SUMMARY 


We have considered a cybernetic model in which learning and cognition appear as 
components of an evolutionary process in a self organizing system. The model il- 
luminates the role of ‘distributed’ or ‘non-localized’ functions and it possesses an 
interesting asymmetry which seems to account for some of the peculiar discontinuities 
in behaviour associated with ‘attention’ and ‘insight’. 

In order to discuss the activity in a self organizing system it is necessary to adopt 
a descriptive framework applicable in the case when we (acting as observers or ex- 
perimenters) suffer a structural uncertainty, which we do (though this is often for- 
gotten) in any discussion of intellect. To illustrate the point, we cannot define the 
representation of a message, like an engineer, with complete certainty. But this does 
not mean that the model is unconstructable. Indeed, a functioning physical realization 
of the model exists. Until a few months ago the most likely correlate in the brain for 
the evolutionary process in this cybernetic model seemed to be a changing mode of 
activity in resonant neurone circuits. Whilst this interpretation is still plausible, it 
leads us to look for further physical mechanisms with which the resonant circuits 
interact. Some recent work from other countries indicates a number of physiological 
mechanisms acting at a macromolecular level, that may embody the required inter- 
actions. 
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